Download On the use of zero-crossing rate for an apllication of classification of percussive sounds
We address the issue of automatically extracting rhythm descriptors from audio signals, to be eventually used in content-based musical applications such as in the context of MPEG7. Our aim is to approach the comprehension of auditory scenes in raw polyphonic audio signals without preliminary source separation. As a first step towards the automatic extraction of rhythmic structures out of signals taken from the popular music repertoire, we propose an approach for automatically extracting time indexes of occurrences of different percussive timbres in an audio signal. Within this framework, we found that a particular issue lies in the classification of percussive sounds. In this paper, we report on the method currently used to deal with this problem.
Download Rhythmic expressiveness transformations of audio recordings: swing modifications
In this paper, we propose a computer software for modifying rhythmic performances of polyphonic musical audio signals. It first describes the rhythmic content of an audio signal (i.e. determination of tempi and beat indexes at the quarter-note and eighth-note levels, as well as estimation of the swing ratio). Then, the signal is transformed in real-time using a time-stretch algorithm. We present basic techniques provided by commercial products for swing modification and compare these to our system.
Download An Open Source Tool for Semi-Automatic Rhythmic Annotation
We present a plugin implementation for the multi-platform WaveSurfer sound editor. Added functionalities are the semi-automatic extraction of beats at diverse levels of the metrical hierarchy as well as uploading and downloading functionalities to a music metadata database. It is built upon existing open source (GPL-licenced) audio processing tools, namely WaveSurfer, BeatRoot and CLAM, in the intent to expand the scope of those softwares. It is therefore also provided as GPL code with the explicit goal that researchers in the audio processing community can freely use and improve it. We provide technical details of the implementation as well as practical use cases. We also motivate the use of rhythmic metadata in Music Information Retrieval scenarios.
Download Semi-automatic Ambience Generation
Ambiances are background recordings used in audiovisual productions to make listeners feel they are in places like a pub or a farm. Accessing to commercially available atmosphere libraries is a convenient alternative to sending teams to record ambiances yet they limit the creation in different ways. First, they are already mixed, which reduces the flexibility to add, remove individual sounds or change its panning. Secondly, the number of ambient libraries is limited. We propose a semi-automatic system for ambiance generation. The system creates ambiances on demand given text queries by fetching relevant sounds from a large sound effect database and importing them into a sequencer multitrack project. Ambiances of diverse nature can be created easily. Several controls are provided to the users to refine the type of samples and the sound arrangement.